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ABSTRACT To help define the biological functions of nonessential genes of Francisella novicida, we measured the growth of ar- 
rayed members of a comprehensive transposon mutant library under a variety of nutrition and stress conditions. Mutant pheno- 
types were identified for 37% of the genes, corresponding to ten carbon source utilization pathways, nine amino acid- and 
nucleotide-biosynthetic pathways, ten intrinsic antibiotic resistance traits, and six other stress resistance traits. The greatest sur- 
prise of the analysis was the large number of genotype-phenotype relationships that were not predictable from studies of Esche- 
richia coli and other model species. The study identified candidate genes for a missing glycolysis function (phosphofructoki- 
nase), an unusual proline-biosynthetic pathway, parallel outer membrane lipid asymmetry maintenance systems, and novel 
antibiotic resistance functions. The analysis provides an evaluation of annotation predictions, identifies cases in which funda- 
mental processes differ from those in model species, and helps create an empirical foundation for understanding virulence and 
other complex processes. 

IMPORTANCE The value of genome sequences as foundations for analyzing complex traits in nonmodel organisms is limited by 
the need to rely almost exclusively on sequence similarities to predict gene functions in annotations. Many genes cannot be as- 
signed functions, and some predictions are incorrect or incomplete. Due to these limitations, genome-scale experimental ap- 
proaches that test and extend bioinformatics-based predictions are sorely needed. In this study, we describe such an approach 
based on phenotypic analysis of a comprehensive, sequence-defined transposon mutant library. 
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The value of a genome sequence as a basis for understanding 
complex activities of an organism depends on the accuracy 
and completeness of its annotation. Function predictions in ge- 
nome annotations are based largely on identifying related se- 
quences characterized experimentally in model organisms. While 
such predictions can be highly informative, sequence relatedness 
generally identifies molecular activities much better than biolog- 
ical functions (1, 2). The fraction of genes in new genome se- 
quences which cannot be assigned functions at all also remains 
high, typically around 30% (3). There is thus a great need for new 
approaches to supplement sequence-based prediction methods 
for assigning biological functions in genome annotations. 

In this study, we used large-scale phenotyping to assign func- 
tions to genes of a nonmodel prokaryote, Francisella novicida. 
Large-scale phenotyping has largely been restricted to model or- 
ganisms for which libraries of deletion-insertion mutants have 
long been available. For example, chemogenomic profiling of Sac- 
charomyces cerevisiae using highly sensitive growth assays detected 
phenotypes for most genes (4), and studies of Escherichia coli de- 
fined numerous antibiotic resistance, other inhibitor resistance, 
and carbon source utilization genes (5-8). Large-scale phenotyp- 
ing has recently been extended to two nonmodel species through 
the analysis of sequence-defined libraries of transposon mutants 
(9-11). One study examined Shewanella oneidensis and identified 
functions for 40 underannotated genes (11). A second study ex- 



amined Pseudomonas aeruginosa but did not provide specific 
genotype-phenotype associations (10) 

F. novicida is a low-virulence surrogate of F. tularensis, one 
of the most infectious pathogens known (12-14). F. novicida is 
ideal for genomic studies of virulence and other processes 
because it has a small genome (1.9 Mbp) and may be readily 
manipulated ( 15, 16). In addition, a manual genome annotation is 
available (http://go.francisella.org/cgi-bin/frangb/elementlist 
.cgi?gac=Francisella_tularensis_novicida_U112) (17), and a 
near-saturation, sequence-defined transposon mutant library has 
been created (18). In this study, we sought to evaluate genome 
annotation predictions by examining growth of the members of 
the library under a variety of nutritional and stress conditions. The 
results provide direct tests of annotation assignments and specify 
numerous unexpected genotype-phenotype associations. 

RESULTS 

F. novicida phenotyping was carried out using an arrayed, 
sequence-defined transposon mutant library. Growth of the 
members of the library on a variety of agar media was quantified 
using image processing. The growth conditions examined were 
chosen to help define the genetic basis of nutrition and resistance 
traits potentially important in both the process of Francisella in- 
fection and in its response to antimicrobial therapy. For broadly 
distributed traits that have been characterized in other bacteria 
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FIG 1 Identification of the F. novicida phosphofructokinase gene. (A) Glycolysis pathway steps. F. novicida gene assignments are derived from the sequence 
annotation, and results are presented here. (B) Growth of mutants on different carbon sources. The gluconeogenesis mutant (gplX) is selectively defective for 
growth on short-chain carbon sources, whereas the aldolase mutant (fbaA), defective in both glycolysis and gluconeogenesis, is defective for growth on both 
sugars and short-chain carbon sources. Mutations in FTN_1210 lead to defects in growth on sugars, suggesting a glycolysis defect. NAG, iV-acetylglucosamine. 
(C) Complementation of an E. coli phosphofructokinase mutant with FTN_1210. The pronounced growth defect of an E. colt pfk mutant (JW3887) on sorbitol 
as the carbon source (40) is complemented by expression of FTN_1210, supporting its assignment as the pfk gene. FTN_1210 also complemented an E. coli pfk 
mutant growth defect on glucose (not shown). 



(such as intrinsic antibiotic resistance), the tests also aimed to 
identify conserved elements. 

Three-allele transposon mutant library. We sought to ana- 
lyze multiple independent mutants for each gene to provide con- 
firmation of genotype-phenotype assignments. This redundancy 
should minimize erroneous assignments due to allele-specific ef- 
fects, unlinked mutations, and cross-contamination. We there- 
fore assembled a mutant library made up of an average of three 
different insertion mutants per predicted gene by supplementing a 
previously assembled two-allele library with mutants from a larger 
primary library (18). The new three-allele library contains a total 
of 4,571 mutants, with 1,456 of 1,767 genes represented. The li- 



brary includes strains with mutations in virtually all protein- 
coding genes nonessential for growth on nutrient medium (Tryp- 
ticase soy agar plus cysteine). The genomic locations of nearly all 
(97.5%) of the transposon insertions in the library were confirmed 
by at least two independent sequencings. The number of different 
insertions per gene areO (311 genes), 1 (133 genes), 2 (177 genes), 
3 (748 genes), 4 (289 genes), and >5 (109 genes). There are 111 
predicted insertions in intergenic regions. The 311 unrepresented 
genes are presumably essential. 

Mutant phenotyping. We examined 38 phenotypes correspond- 
ing to nutritional and stress resistance traits, including antibiotic re- 
sistance (Materials and Methods). The growth yields of mutants of 
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the three-allele library were assayed after spotting on agar medium in 
a 384-well format and incubation for 24 to 48 h. Plates were imaged 
under dark-field illumination using high-resolution digital photog- 
raphy, and the images were processed to quantify growth of each 
mutant spot. Duplicate tests of each condition were carried out, and 
results for individual mutant alleles were averaged. A growth differ- 
ence from the norm was considered significant if the probability of its 
arising by chance was less than 0.01. 

A total of 4,445 mutants yielded reproducible data (see Data 
Set SI in the supplemental material), with 37% (545/1,456) of the 
genes analyzed exhibiting mutant phenotypes (see Data Set S2). 
Most of the genotype-phenotype associations (82%) were con- 
firmed for multiple mutations in a given gene. The remainder 
(18%) corresponded to genes for which only one allele was repre- 
sented in the library. (Cases in which only one of multiple alleles 
led to a phenotype were not included in Data Set S2.) Approxi- 
mately 23% (123/545) of the genes with mutant phenotypes were 
annotated as hypothetical genes and other genes of unknown 
function, 19% (103/545) were assigned general functions only 
(e.g., ABC transporter, ATP-binding protein) and 58% (319/545) 
were assigned specific functions (e.g., recombinase A protein). 

Carbon source utilization. We examined growth of members 
of the three-allele library on thirteen carbon sources, including 
sugars and short-chain compounds. The choice of carbon sources 
was guided by the results of a phenotypic microarray screen of the 
wild-type strain (19). 

The genome annotation predicted complete glycolytic and glu- 
coneogenic pathways, with the exception of phosphofructokinase. 
Mutations inactivating two steps of the pathways (fructose 1,6- 
bisphosphatase [gplX] and fructose 1,6-bisphosphate aldolase 
[fbaA] ) were represented in the library and led to pleiotropic phe- 
notypes compatible with their roles (Fig. 1A and B). Mutations in 
a third gene (FTN_1210) prevented growth on sugars but not on 
short-chain carbon sources, suggesting a block in glycolysis. Since 
the FTN_1210 product shows sequence homology to sugar ki- 
nases (and was originally annotated as a ribokinase), the pheno- 
type suggests that the gene encodes the missing phosphofructoki- 
nase. 

To test this assignment, we examined whether cloned FTN_1210 
could complement an E. coli phosphofructokinase mutant. The Fran- 
cisella gene did indeed correct the mutant E. coli carbon source utili- 
zation defect (Fig. 1C), providing strong support for assignment of 
FTN_1210 as the phosphofructokinase gene (pfk). 

The assays also identified catabolic pathways for ten com- 
pounds (Table 1), seven of which had been fully predicted in the 
sequence annotation. For the eighth (AT-acetylglucosamine), the 
results suggest a possible pathway for which three of four steps 
were new (see Fig. SI in the supplemental material), for the ninth 
(mannose), a required kinase was identified that also acts on fruc- 
tose (see Fig. S2), and for the tenth (fructose), a transporter (FruP) 
and an isomerization function (ManA) were identified. 

For many of the carbon sources, there were mutant assign- 
ments that could not be readily fitted into the predicted pathways 
(Table 1; also, see Table SI in the supplemental material). For 
example, mutations inactivating a predicted quinate/shikimate 
dehydrogenase (FTN_0511) led to a relatively specific defect in 
fructose utilization, and mutations inactivating a predicted 
glycogen-biosynthetic function (glgB) blocked growth on multi- 
ple sugars. 
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a The numbers of genes associated with different nutritional defects are shown 
according to whether or not they were predicted by annotation. Specific genes are listed 
in Table SI in the supplemental material 
b Complete pathway identified. 



Amino acid and nucleotide biosynthesis. F. novicida is natu- 
rally auxotrophic for four amino acids (arginine, histidine, methi- 
onine, and cysteine) but can synthesize the remaining amino acids 
and nucleotides. Mutant library analysis identified most or all 
steps of the pathways for the synthesis of leucine, isoleucine/va- 
line, threonine, lysine, phenylalanine/tyrosine/tryptophan, pu- 
rines, pyrimidines, and pantothenate, suggesting several new as- 
signments in the process (Table 1; also, see Table SI in the 
supplemental material). 

Proline presented an unusual case. Although F. novicida lacks 
the proline-biosynthetic pathway (ProABC) found in E. coli 
(Fig. 2A, left), the organism could grow in the absence of proline if 
a substitute source of a-ketoglutarate (such as its transamination 
product glutamate) was provided, presumably as a source of car- 
bon for the tricarboxylic acid cycle. Four findings implied that F. 
novicida encodes a pathway for generating proline from arginine 
(Fig. 2A, right). First, a screen for mutants requiring proline iden- 
tified two genes (FTN_0854 and FTN_1444) with predicted enzy- 
matic functions (aminotransferase and ornithine cyclodeami- 
nase) compatible with the pathway (Fig. 2B). Second, the proline 
requirement of FTN_0854 (aad) mutants could be satisfied by 
ornithine (Fig. 2B). Third, mutations inactivating a proChomolog 
(FTN_1467) did not lead to proline auxotrophy (Fig. 2B). Finally, 
the FTN_0854 (aad) and FTN_1444 genes (ocd) complemented 
the proline requirement of an E. coli proAB deletion mutant 
(Fig. 2C). Both genes were necessary. The F. novicida proline- 
biosynthetic pathway resembles the arginine catabolic pathway 
encoded by the Agrobacterium tumefaciens Ti plasmid (20) but 
generates ornithine from arginine by means of an amidinotrans- 
ferase rather than an arginase step. 
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FIG 2 Proline biosynthesis. (A) Proline-biosynthetic pathway. (Left) Pathway found in E. coli; (right) proposed pathway for F. novicida. P-5-C, 1-pyrroline- 
5-carboxylate; aad, arginine amidinotransferase gene (amidino group [-CNHNH2] acceptor unknown); ocd, ornithine cyclodeaminase gene. (B) F. novicida 
proline auxotrophs. Mutations in aad and ocd genes prevented growth on medium lacking proline (FNDM supplemented with glutamate) (see Text SI in the 
supplemental material). As predicted, aad mutants grew without proline in medium containing ornithine. Mutations in aproChomolog (FTN_1467) did not 
reduce growth on medium lacking proline. (C) Complementation of an E. coli proline auxotroph. Expression of aad and ocd in E. coli complemented a proAB 
deletion mutant (M8820) for growth in the absence of proline. 



Mutants defective in early steps of the lysine -biosynthetic path- 
way (dapABDE) did not require diaminopimelate (DAP), even 
though DAP is the immediate precursor of lysine and is normally 
essential. Furthermore, genes encoding the DAP-lysine- 
biosynthetic pathway are absent from highly virulent Francisella 
genomes (21). The results raise the possibility that DAP is not 
required for peptidoglycan biosynthesis in Francisella. 

Resistance to multiple antibiotics. Mutations in several genes 
increased sensitivity to multiple antibiotics and other agents (see 
Fig. S3 in the supplemental material). Mutants with related sensitivity 
patterns affected genes for the AcrAB efflux pump (FTN_1609- 1610) 
(22), and a TolC homolog (FTN_1703) (23), suggesting that they 
may function together. Two additional potential efflux transporters 



were also identified (FTN_0570 and FTN_1 180-1 181), as was a po- 
tential rhodanese family sulfurtransferase (FTN_0120). 

Resistance to quinolone antibiotics. The quinolone antibiot- 
ics ciprofloxacin and nalidixic acid inhibit DNA gyrase, leading to 
double-strand DNA breaks. Accordingly, many of the mutations 
increasing sensitivity to these antibiotics affect DNA recombina- 
tion and repair functions, including recABCD and ruvABC (see 
Fig. S4 in the supplemental material). Homologs of many of the F. 
novicida resistance functions also contribute to quinolone resis- 
tance in F. coli (5, 6). Novel functions include a replication heli- 
case {rep) and exonuclease I (sbcB). 

Resistance to aminoglycoside antibiotics. Aminoglycoside 
antibiotics, such as gentamicin, target the 3 OS ribosome subunit to 
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FIG 3 Aminoglycoside hypersensitive mutants. Mutations leading to increased streptomycin and gentamicin sensitivity are shown. Shadings reflect average 
growth of the two alleles with the strongest defects for each gene, and nutritional phenotypes are not shown. Asterisks indicate cases in which polar effects on 
expression of downstream genes could contribute to phenotypes. Several genes whose inactivation led to pleiotropic phenotypes are not included (plsQ glnA, 
FTN_0802, and FTN_0917). The number of mutant alleles for each gene leading to increased sensitivity is provided. Functions associated with intrinsic 
aminoglycoside resistance in E. coli or P. aeruginosa are indicated (5, 6, 25, 27). 



enhance misreading, resulting in aberrant proteins that compro- 
mise membrane integrity (24). Mutations in four genes associated 
with proteolysis (hslVU,yccA, and sohB) enhance aminoglycoside 
sensitivity (Fig. 3), consistent with previous results suggesting that 
proteolysis of mistranslation products maybe protective (25). The 
E. coli homolog of one of the functions (YccA) facilitates the de- 
struction of secretion complexes "jammed" by proteins unable to 
be secreted (26). Mistranslation products unable to be secreted 
could be eliminated in an analogous manner. Mutations reducing 
export capacity (such as in secG and yidD) could increase the sen- 
sitivity of cells to such jamming by reducing the number of func- 
tional secretion complexes. Many aminoglycoside sensitive mu- 
tants were pleiotropic, with increased sensitivities to conditions 
such as elevated temperature and pH, presumably as a conse- 
quence of elevated sensitivity to mistranslation products even in 
the absence of antibiotic. 

About half of the genes contributing to aminoglycoside resis- 
tance in F. novicida have been associated with intrinsic aminogly- 
coside resistance in E. coli or P. aeruginosa (5, 6, 25, 27). The result 
suggests that the mechanisms responsible for intrinsic aminogly- 
coside resistance are at least partially conserved in the three spe- 
cies. 

Resistance to SDS. Mutations in three clusters of predicted 
transport genes enhanced sensitivity to sodium dodecyl sulfate 
(SDS) (Fig. 4A). Two of the clusters corresponded to an E. coli 
ABC transport system (mlaABCDEF) that helps maintain outer 
membrane phospholipid asymmetry (28), and a third (emrABC) 
encodes a putative efflux pump similar to one in E. coli that con- 
fers resistance to hydrophobic compounds (29). 

Two F. novicida homologs are represented for three of the mla 
genes (mlaDEF), indicating that there are two independent path- 
ways. If the two Mla pathways function independently, a double 
mutant in which both are inactive should be more SDS sensitive 



than either parental single mutant. Indeed, the growth of the dou- 
ble mutants was completely inhibited at a concentration of SDS on 
which the parent single mutants grew well (Fig. 4B). 

DISCUSSION 

The primary goal of this study was to help assign biological func- 
tions to a significant number of F. novicida genes through large- 
scale phenotyping. Assigning biological functions experimentally 
complements the use of sequence similarity to predict function 
and should strengthen the foundation for the genetic dissection of 
complex traits. This "phenotypic annotation" should contribute 
to an understanding of Francisella virulence, which requires met- 
abolic and stress resistance adaptations controlled by hundreds of 
genes (30-35). 

Our analysis employed a library of arrayed, sequence-defined 
transposon mutants with insertions in nearly all nonessential 
genes. On average, three mutants for each gene were assayed to 
provide confirmation of genotype-phenotype assignments and to 
minimize missed assignments. It was crucial to incorporate this 
redundancy into the analysis, since a significant number of indi- 
vidual mutants exhibited unrepresentative phenotypes. A total of 
39 nutritional and stress sensitivity phenotypes were examined 
using image analysis to quantify growth of individual strains. 

There are several limitations in using the approach presented 
here to define gene function. First, transposon insertion muta- 
tions can cause polar effects on gene expression. In this study, 
cases in which a phenotype maybe due to polarity can generally be 
recognized as adjacent genes in the same orientation with overlap- 
ping mutant phenotypes. Second, the relationship between a mu- 
tation and a phenotype can be indirect. Third, the absence of a 
phenotype for a mutation may be due to redundant functions, 
which mask the effects of the genetic defect. One advantage of F. 
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FIG 4 SDS resistance genes. (A) Three clusters of transport genes contributing to SDS resistance are shown. Genes in which mutations led to SDS sensitivity are 
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novicida as a test organism is that its genome is small, which pre- 
sumably limits the extent of such redundancy. 

Mutant phenotypes were identified for 37% of the nonessential 
genes of the organism. The genes ranged from those with "com- 
plete" annotation predictions (i.e., including both molecular and 
biological activities) to those lacking predicted functions alto- 
gether. The genotype-phenotype associations we identified thus 
both provide tests of annotation predictions and help specify the 
functions of underannotated genes. 

The screen identified the genes responsible for ten carbon 
source utilization pathways, nine amino acid and nucleotide bio- 
synthesis pathways, seven intrinsic antibiotic resistance and six 
additional stress resistance networks. For most of the phenotypes, 
there were unexpected gene associations, and for some, the ma- 
jority of assignments were new. We assume that the large number 
of new assignments reflects both physiological novelty in F. novi- 
cida and incomplete characterization of some of the traits in 
model species. 

Although nearly every phenotype we examined presented 
novel elements, there were several particularly noteworthy find- 
ings, (i) The identification of a putative phosphofructokinase in F. 
novicida completes the glycolytic pathway. Homologs are found in 
highly virulent Francisella strains (17). (ii) Proline biosynthesis in 
F. novicida is carried out using a two-step pathway that employs 
the essential amino acid arginine as precursor. The pathway ap- 
pears to be nonfunctional in highly virulent Francisella strains in 
which both genes carry inactivating mutations (17). (hi) Diamin- 



opimelic acid may not be required for peptidoglycan synthesis, 
(iv) Three potential drug efflux functions were identified, two of 
which were new. (v) A subset of quinolone and aminoglycoside 
intrinsic resistance functions of F. novicida is shared with E. coli 
and/or P. aeruginosa (5, 25). (vi) There appear to be two Mla 
transport systems that eliminate phospholipids from the outer 
leaflet of the outer membrane and confer detergent resistance. The 
redundancy may compensate for the apparent absence of an outer 
membrane phospholipase Al (28). 

Although our analysis was restricted to basic growth pheno- 
types, the data set of results includes associations that may help in 
understanding more complex traits, such as virulence. For exam- 
ple, Weiss et al. identified 164 F. novicida genes required for sur- 
vival or growth in mice (32). About half of these genes were asso- 
ciated with growth phenotypes in our study, including 33 genes 
with undefined or incompletely defined functional annotations 
(data not shown). Among the functionally undefined genes were 
ones whose inactivation led to enhanced sensitivity to tempera- 
ture (FTN_0599), pH (FTN_1582), and detergent (FTN_1548), as 
well as carbon source utilization defects (FTN_1016 and 
FTN_1656). Several other screens for virulence genes provide ad- 
ditional examples (30, 31, 33) (data not shown). The phenotypes 
represent potential starting points for further analysis of the roles 
of these genes in virulence. 

This study shows that large-scale phenotypic analysis can pro- 
vide a direct evaluation of genome annotation predictions and 
serve as a source of discovery even for well-characterized pro- 
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FIG 5 Analysis of growth phenotypes. (A) Growth of mutant cells. The growth of mutant spots in 384-well array format on one of the test media (FNDM- 
glucose) after 24 h of incubation is shown. Mutants exhibiting reduced growth are evident. (B) Distribution of mutant growth yields. A histogram representing 
growth of the three-allele mutant library on FNDM-glucose is shown. A tail to the left of mutants with reduced growth is evident. A normal distribution curve 
fitted to the peak centered at zero is shown. 



cesses. The recent development of deep-sequencing methods for 
tracking transposon mutants in large populations should comple- 
ment the approach presented here and make it possible to extend 
genome-scale phenotyping to prokaryotes for which defined mu- 
tant libraries are not yet available (27, 36-39). 

MATERIALS AND METHODS 

Growth and imaging of F. novicida mutants. F. novicida mutant library 
plates were thawed and aliquots deposited onto agar test plates using a 
QPIX robot (Genetix) fitted with a floating head replicator (V &P Scien- 
tific AFIXP96FP6). The replicator deposits about 1 microliter per strain, 
corresponding to about 5 X 10 5 cells. The plates were incubated for 48 h at 
37°C (with the exception of the 46°C and SDS plates, which were incu- 
bated for 24 h) to allow growth of bacteria. Following incubation, a digital 
image of each plate was created using a Nikon D70 camera (28-mm lens) 
mounted on a dark field photographic assembly built for this project. In 
this setup, test plates were illuminated indirectly from below. Bacterial 
spots scatter more light than the surrounding agar, making them brighter 
than the agar background in the resulting images. The intensity of spots 
measured from the images as described below was approximately linear 
with cell number, as measured by viable counts (not shown). An image of 
a representative plate of mutant spots illustrates the range of different 
growth yields observed (Fig. 5A). 

Data analysis. The growth of bacterial spots was quantified from high- 
resolution images (see the supplemental material). Analysis of data was 
carried out in several steps. First, growth values for each mutant from 
independent assays (two or more per condition) were averaged. Second, 
mutants growing poorly on unsupplemented TSAC medium (values of 
<0.4) were eliminated (69 strains total). Third, the growth values of the 
remaining mutants (4,445 strains) were normalized to the growth on 
unsupplemented TSAC and log 2 transformed. Complete results are pre- 
sented in Data Set SI. 

For each condition, growth values were centered near a log 2 of 0 with 
a conspicuous tail corresponding to mutants with decreased growth 
(Fig. 5B). To identify mutants with significantly decreased or increased 
growth, we assumed that the growth values in the peaks centered at a log 2 
of 0 were approximately normally distributed (Fig. 5B). Strains whose 
growth differed significantly from the norm (P value of <0.01 or >0.99) 
were considered candidates for the corresponding mutant phenotypes 
(see Table S2 in the supplemental material). Cases in which at least two 
different mutant alleles of a gene led to a phenotype defined confirmed 
genotype-phenotype associations (see Data Set S2). Cases in which only 
one mutation in a gene led to a phenotype were disregarded unless it was 
the sole mutant allele examined for the gene; these cases defined uncon- 
firmed genotype-phenotype associations and were included in Data 
Set S2. 
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